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Deductive  Reasoning 

Robert  J.  Sternberg,  Martin  J.  Guyote,  and  Margaret  E.  Turner 

Yale  University 

During  the  past  several  years,  the  senior  Investigator  has  been  attempt- 
ing to  develop  a unified  theory  of  human  reasoning.  This  research  has  pro- 
ceeded along  two  major  fronts,  one  Involving  the  formulation  of  a subtheory 
of  Inductive  reasoning,  the  other  Involving  the  formulation  of  a subtheory 
of  deductive  reasoning.  We  plan  to  discuss  here  work  we  have  done  on  deduction. 

The  theory  of  deductive  reasoning  Is  not  yet  completely  formulated  or 
tested,  but  work  on  the  theory  Is  far  enough  along  to  merit  a progress  report. 

So  far,  we  have  formulated  and  tested  models  of  deduction  for  the  three  main 
kinds  of  syllogisms  that  have  been  Investigated  by  students  of  human  reasoning: 
categorical,  conditional,  and  linear  syllogisms.  We  will  summarize  the  theory 
and  data  for  each  of  the  three  kinds  of  syllogisms  below.  Then  we  will  draw 
some  conclusions,  and  mention  the  directions  In  which  our  current  research 
Is  going. 


The  Nature  of  Categorical  Syllogisms 

A categorical  syllogism  comprises  three  declarative  statements,  each  of 
which  describes  a relation  between  two  sets  of  Items.  The  first  two  state- 
ments, called  the  major  and  minor  premise  respectively,  are  givens.  The  third 
statement,  called  the  conclusion,  follows  with  logical  necessity  from  the 
premises.  Categorical  syllogisms  are  of  two  basic  types.  In  the  first  type, 
both  the  major  and  minor  premises  express  relations  between  two  sets  of  ob- 
jects, one  of  which  overlaps  between  premises.  The  conclusion  expresses  a 
relation  between  the  nonoverlapping  sets  of  objects.  An  example  of  such  a 
syllogism  Is  "All  B are  C.  All  A are  B.  Therefore,  all  A are  C^." 
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second  type  of  syllogism,  the  major  premise  expresses  a relation  between 
two  sets  of  objects,  and  the  minor  premise  expresses  a relation  between  a 
particular  Item  and  one  of  the  two  seta  of  objects.  The  conclusion  expresses  a 
relation  between  that  member  and  the  other  set.  An  example  of  such  a 
syllogism  Is  "All  A are  X Is  an  A.  Therefore,  X Is  a B."  We  will  consider 

In  this  part  of  the  article  only  the  first,  more  widely  studied  type  of  syllogism. 
The  Trans It Ive-Chaln  Theory  of  Categorical  Syllogistic  Reasoning 

Representation  of  Information.  Figure  1 names  the  five  possible  set 


Insert  Figure  1 about  here 

relations,  and  shows  how  these  relations  are  represented  In  both  conventional 
Euler-dlagram  format  and  In  the  symbolic  format  we  propose.  Each  symbolic 
representation  consists  of  two  distinct  components,  one  (at  the  left)  Indicating 
how  many  mendiers  of  Set  A are  also  members  of  Set  B,  the  other  (at  the  right) 
Indicating  how  many  members  of  Set  B are  also  members  of  Set  A.  In  this  no- 
tation, lowercase  letters  stand  for  disjoint,  exhaustive  partitions  of  a set. 
Thus,  for  example,  lowercase  a^  and  a^  are  mutually  exclusive  and  exhaustive 
with  respect  to  Set  A.  Uppercase  letters  refer  to  whole  sets,  and  the  arrow 
relation  Indicates  that  the  partition  to  the  left  of  the  arrow  Is  a proper 
subset  of  the  set  to  the  right.  Components  can  be  referred  to  by  the  order 
of  the  terms  within  them.  Thus,  all  left-hand  components  In  the  table  are  AB 
components,  and  all  right-hand  components  are  BA  components. 

Let's  consider  a couple  of  examples  to  see  how  the  notation  works.  Con- 
sider first  set  equivalence  (Identity).  Note  that  both  partitions  of  A,  a^^  and 
^2*  proper  subsets  of  B,  and  both  partitions  of  B,  b^  and  b2>  are  proper 
subsets  of  A.  Thus,  all  a's  are  B's  and  all  b's  are  A's,  as  Is  the  case  for 
set  equivalence.  Consider  now  the  second  set  relation,  set-superset.  Notice 
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at  the  right  that  although  all  a's  are  B's,  only  some  b's  are  A’s:  In  the 
component  at  the  right,  Is  a proper  subset  of  not  A,  rather  than  of  A. 

This  relation,  then.  Indicates  that  B Is  a superset  of  A.  To  summarize, 
the  basic  Idea  Is  that  each  set  relation  can  be  represented  by  a notation 

Indicating  the  relative  number  of  a's  that  are  B's  and  b's  that  are  A's. 

2 

Combination  of  representations . Figure  2 shows  how  two  simple  Inferen- 
Insert  Figure  2 about  here 

tlal  rules  can  be  applied  to  the  symbolic  representations  of  set  relations  to 
effect  the  combination  of  any  two  representations.  The  proposed  representation 
has  the  advantage  of  permitting  combination  to  occur  via  the  two  rules.  None 
of  the  alternative  theories  of  syllogistic  reasoning  that  have  been  proposed 
specify  comparable  rules  by  which  Euler  diagrams  or  other  forms  of  represen- 
tation can  be  combined. 

The  first  rule  states  that  If  a partition  Is  a proper  subset  of  Y and 
a partition  y^  (where  j may  but  need  not  equal  1)  Is  a proper  subset  of  Z, 
then  Is  a proper  subset  of  Z.  This  rule  applies  when  the  two  middle  terms 
match  In  polarity,  that  Is,  are  both  affirmative.  It  Is  from  this  rule  that 
the  transltlve-chaln  theory  derives  Its  name,  since  elements  are  combined 
by  forming  simple  transitive  chains. 

The  second  rule  states  that  If  a partition  x^  Is  a proper  subset  of  not 
Y and  a partition  y^  (where  j may  but  need  not  equal  1)  Is  a proper  subset 
of  Z,  then  x^  may  be  a proper  subset  of  either  Z or  not  Z;  one  can't  tell 
for  sure.  This  rule  applies  when  the  two  middle  terms  do  not  match  In  polari- 
ty, that  Is,  the  first  Is  negative  and  the  second  affirmative.  In  this  case, 
one  cannot  form  a transitive  chain. 

Consider  an  example  In  which  these  two  rules  are  applied  to  combining  two 
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representations,  one  In  which  B Is  a subset  of  A and  the  other  In  which  C Is 
a superset  of  B.  There  are  two  ways  In  which  transitive  chains  mli^ht  be  formed 
from  the  two  sets  of  components:  first  by  combining  AB  with  BC  (since  the  middle 
terms  match) , and  second  by  combining  CB  with  BA  (again  since  the  middle  terms 
match) . 

First,  let's  combine  the  AB  component  with  the  BC  component.  Rule  1 can 
be  applied  twice:  We  can  form  a first  transitive  chain  by  linking  a^^  to  B 
with  bj^  to  C,  yielding  a^^  to  C;  we  can  form  a second  transitive  chain  by 
linking  to  B with  ^2  to  C,  again  yielding  a^^  to  C.  We  write  the  two  a to 
C relations  In  the  first  row  to  the  right  of  the  double  arrow.  Rule  2 can 
also  be  applied  twice,  since  we  are  unable  to  form  a transitive  chain  from 
a^  to  C via  either  bj^  or  b^.  Rather  than  writing  tw  redundant  rows  to  the 
right  of  the  double  arrow,  we  simply  write  the  result  once:  a^  can  be  linked 
to  either  C or  not  C. 

Next,  let's  combine  CB  with  BA.  Through  Rule  1,  the  c^^  partition  can 
be  linked  to  A through  either  bj^  or  b^;  the  two  Cj^  to  A relations  are  Indicated 
at  the  right  of  the  double  arrow.  Through  Rule  2,  we  find  that  C2  can  be  linked 
to  either  A or  not  A,  also  as  Indicated  at  the  right  of  the  double  arrow.  We 
have  now  completed  the  combination  process,  ending  up  with  two  AC  and  two  CA 
representations . 

There's  just  one  more  step  left.  You'll  remember  that  each  original 
representation  consisted  of  an  AB  component  and  a BA  component.  Similarly, 
each  final  representation  must  consist  of  an  AC  component  and  a CA  component. 

But  our  representations  as  they  now  stand  consist  of  either  two  AC  or  two  CA 
components.  Our  final  step,  therefore.  Is  to  rearrange  the  components  Into 
canonical  form.  There  are  four  ways  In  which  this  rearrangement  can  be 
realized:  by  combining  ACj^  with  CAj^,  ACj^  with  CA^,  AC^  with  CA^,  or  ACj  with  CA2. 
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In  this  particular  example,  each  of  these  rearrangements  yields  a unique 
final  representation,  although  this  need  not  be  true  In  general.  Note  that 
by  using  the  two  simple  rules  of  Inference,  we  have  discovered  four  possible 

/ 

final  representations  that  can  result  from  combination  of  the  two  original 

ones:  £ and  A equivalent,  £ superset  of  A,  £ subset  of  A,  and  £ and  A , 

overlapping. 

Information-processing  model.  The  description  of  the  transltlve-chai:« 
theory  up  to  now  has  been  for  the  Ideal  subject — one  who  can  process  Informa- 
tion without  making  errors.  Subjects  do  make  errors,  of  course,  and  the 
transltlve-chaln  theory  specifies  the  processes  that  give  rise  to  these  errors. 

In  the  transltlve-chaln  theory,  as  In  other  theories  of  syllogistic  reason- 
ing, there  are  four  basic  stages  of  processing:  encoding,  during  which  the  premises 
are  read  and  Interpreted;  combination,  during  which  Information  from  the  premises 
Is  Integrated;  comparison,  during  which  the  combined  representation  Is  compared 
to  possible  labels  for  the  representation  (such  as  "All  A are  £"  and  "Some  A 
are£");  and  response,  during  which  the  subject  communicates  a response. 

According  to  the  transltlve-chaln  theory,  encoding  and  response  are  error-free. 

Erroneous  responses  result  from  errors  made  In  combination  and  comparison. 

Errors  during  the  combination  stage  arise  from  limitations  In  the  ability 
of  working  memory  to  hold  all  possible  combinations.  A standard  classical 
syllogism  can  require  as  few  as  one  or  as  many  as  sixteen  pairs  of  set  relations 
to  be  combined.  For  example.  In  the  syllogism,  "No  £ are  £.  No  A are  £,"  each 
premise  can  be  represented  by  only  one  set  relation,  meaning  that  only  one 

i 

combination  need  be  performed.  In  the  syllogism,  "Some  £ are  £.  Some  A are 
£,"  however,  each  premise  can  be  represented  by  four  set  relations,  meaning 
that  sixteen  (four  times  four)  combinations  need  to  be  performed. 

According  to  the  theory,  subjects  combine  a maximum  of  four  set  relations. 


H 


- -----  ^ 

«il 

Deductive  Reasoning  -.1 

6 

Moreover,  there  Is  a three-tier  preference  hierarchy  that  places  some  constraints 
on  the  order  In  which  set  relations  are  combined.  In  particular,  equivalence  ^ 

relations  are  combined  before  non-equivalent  syaMtrleal  ones  (overlap  and  dis- 
joint sets),  which  In  turn  are  combined  before  asymmetrical  ones  (set-superset 


and  set-subset).  This  ordering  reflects  the  ease  with  which  relations  of  each 

kind  are  stored  and  manipulated  In  working  memory.  Symmetrical  relations  are  I 

those  for  which  the  polarities  of  the  elements  of  the  left-hand  side  of  each 

component  match  the  polarities  of  the  elements  of  the  right-hand  side  of 

each  component.  A quick  glance  back  at  Figure  1 will  reveal  symmetry  of  polar-  | 

Itles  only  for  equivalence,  overlap,  and  disjoint  relations.  Four  parameters 
of  Information  processing  arise  from  the  comblitatlon  stage — Pj,  P2,  Pj,  and  p^ — 
representing  the  respective  probabilities  that  exactly  1,  2,  3,  or  4 pairs  of 
set  relations  are  combined. 

Errors  during  the  comparison  stage  arise  from  simplifying  heuristics 
subjects  use  to  facilitate  selection  of  a label  for  combined  pairs  of  represen- 
tations. If  no  label  Is  consistent  with  all  of  the  combined  set  relations  gen- 
erated during  combination,  the  subject  labels  the  relationship  between  A and  £ 
indeterminate,  choosing  "None  of  the  above"  as  an  answer.  If  only  one  label  Is 
correct,  then  the  subject  chooses  that  one.  But  sometimes  two  labels  are  consis- 
tent with  the  representation  generated  during  the  combination  stage.  For  example, 
the  final  set  relation  A subset  of  C can  be  represented  either  as  All  A are  £ or  as  . | 

Some  A are  £.  In  this  case,  some  basis  Is  needed  for  choosing  between  labels.  ] 

i 

Whenever  two  labels  are  consistent  with  all  set  relations  generated  during  | 

the  combination  stage,  one  of  these  labels  will  be  stronger  than  the  other,  and 
one  of  the  labels  (but  not  the  other)  will  match  the  atmosphere  of  the  premises. 

The  stronger  of  two  labels  Is  the  label  with  fewer  possible  set  relations  In 
Its  representation.  For  example.  All  A are  C Is  stronger  than  Some  A are  £, 
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because  the  universal  statement  can  be  represented  by  only  two  set  relations 
(equivalence  and  set-superset),  whereas  the  particular  statement  can  be  repre- 
sented by  four  set  relations  (equivalence,  set-superset,  set-subset,  set 
overlap).  The  atmosphere  of  two  premises  Is  determined  by  the  standard  rules: 

It  la  particular  (leading  to  the  choice  of  a particular  conclusion)  If  at 
least  one  premise  Is  particular,  and  negative  (leading  to  the  choice  of  a 
negative  conclusion)  If  at  least  one  premise  Is  negative. 

The  bases  for  choosing  a label  when  two  labels  are  possible  take  Into 
account  strength  and  atmosphere  of  the  premises.  It  may  be  that  each  of  the 
two  possible  labels  meets  one  of  the  two  criteria,  or  that  one  of  the  two 
labels  meets  both.  Suppose  the  former  Is  true:  Each  label  meets  one  criterion. 
When  one  label  Is  weaker  than  the  other  label,  but  matches  the  atmosphere  of 
the  premises.  It  Is  chosen  with  probability  Bj^,  and  the  stronger  label  Is 
chosen  with  probability  (1  - Bj^)  • Suppose  the  latter  Is  true:  One  of  the 
two  labels  meets  both  criteria.  When  one  label  Is  both  the  stronger  label 
and  matches  the  atmosphere  of  the  premises.  It  Is  chosen  with  probability 
and  the  other  label  is  chosen  with  probability  (1  - B2)* 

There  Is  one  more  source  of  error  in  the  comparison  stage.  This  arises 
when  the  final  set  relations  generated  during  the  combination  stage  have 
different  Initial  components.  In  the  example  described  in  Figure  2,  for 
example,  two  of  the  pairs  of  components  have  a^  and  a^  both  linked  to  C,  and 
two  have  a^  linked  to  C but  a^  linked  to  not  C.  In  such  cases,  subjects  are 
hypothesized  occasionally  to  mistake  this  discrepancy  as  Indicating  the  In- 
determinacy of  the  conclusion.  When  this  happens,  the  subject  mistakenly 
labels  the  relationship  between  A and  C as  Indeterminate  with  probability  c. 
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Alternative  Theories  of  Categorical  Syllogistic  Reasoning 

The  constraints  of  space  unfortunately  permit  only  the  briefest  descrip- 
tion of  the  alternative  Information-processing  models  to  which  we  compared  the 
' transltlve-chaln  Information-processing  model.  Details  can  be  found  In  the 

original  papers,  and  In  two  of  our  own  papers  (Guyote  & Sternberg,  Note  1; 
Sternberg  & Turner,  Note  2) . 

In  the  transltlve-chaln  model,  errors  occur  during  combination  and  com- 
parison, but  not  during  encoding.  In  the  complete-combination  model  of  Erickson 
(1974),  errors  occur  during  encoding  and  comparison  but  not  during  combination. 

In  the  random  combination  model  of  Erickson  (1974),  errors  occur  during  encoding, 
combination,  and  comparison.  The  atmosphere  model  of  Woodworth  and  Sells  (1935) 
Is  essentially  one  of  aloglcal  information  processing.  Sublects  encode,  com- 
bine, and  compare  only  the  quantification  (universal  or  particular)  and  po- 
larity (affirmative  or  negative)  of  the  premises.  And  in  the  conversion  model 
of  Chapman  and  Chapman  (1959),  errors  in  syllogistic  reasoning,  due  to  conver- 
sion of  premises,  occur  during  the  encoding  and  comparison  stages  of  processing. 

The  numbers  of  parameters  estimated  differed  widely  across  models,  an  in- 
evitable consequence  of  the  different  Information-processing  assumptions  the 
models  make.  Thus,  the  transltlve-chaln  model  involved  estimation  of  seven 
free  parameters,  the  complete  and  random  combination  models  involved  estimation 
of  thirteen  free  parameters  apiece,  and  the  atmosphere  and  conversion  models 
each  Involved  estimation  of  one  free  parameter.  We  were  not  particularly  con- 
cerned  with  the  differing  numbers  of  parameters,  however,  for  three  reasons, 
t First,  our  major  concern  was  with  comparing  the  historically  Important  models 

In  a way  that  did  full  justice  to  the  Initial  conceptualizations,  and  these 
conceptualizations  differ  widely  In  their  complexity  and  completeness.  Second, 
we  always  estimated  large  numbers  of  data  points  (at  least  100)  In  comparing 
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models,  thus  minimizing  the  opportunity  for  capitalization  upon  chance  variation 
In  the  data.  Third,  the  fits  of  the  models  showed  little  correspondence  to 
numbers  of  parameters  In  the  models,  suggesting  that  number  of  parameters 
was  not  an  Important  determinant  of  fit. 

Empirical  Tests  of  the  Models 

Method.  Three  experiments  were  conducted  with  Yale  undergraduates  that 
are  relevant  to  distinguishing  the  theories  noted  above. 

In  a first  experiment,  subjects  received  pairs  of  premises  with  abstract 
content,  and  had  to  choose  the  best  of  five  possible  conclusions,  for  example, 

"All  B are  £.  All  A are  (a)  All  A are  C.  (b)  No  A are  (c)  Some  A 
are  (d)  Some  A are  not  (e)  None  of  the  above.  Half  of  38  syllogisms 
had  at  least  one  valid  conclusion  from  among  options  (a)  to  (d),  the  other  half 
did  not.  Each  of  49  subjects  received  all  of  the  syllogisms. 

In  a second  experiment,  subjects  received  pairs  of  premises  with  concrete 
content,  and  again  had  to  choose  the  best  of  five  possible  conclusions.  Content 
could  be  either  factual,  for  example,  "No  cottages  are  skyscrapers.  All  sky- 
scrapers are  buildings;"  counterfactual,  for  example,  "No  milk  cartons  are 
containers.  All  containers  are  trash  cans;"  or  anomalous,  for  example,  "No 
headphones  are  planets.  All  planets  are  frying  pans."  Note  that  anomalous 
premises  could  be  either  factually  correct  (as  was  the  major  premise  of  the 
example)  or  Incorrect  (as  was  the  minor  premise  of  the  example) : In  either 
case,  though,  the  subject  and  predicate  of  the  premise  were  semantically  un- 
related (or  close  to  It) . Each  of  20  syllogism  types  was  presented  to  each  of 
50  subjects  once  with  each  type  of  content.  Items  were  not  blocked  by  content  type. 
Subjects  In  this  experiment  were  given  the  verbal  reasoning,  spatial  visualization, 
and  abstract  reasoning  tests  of  the  Differential  Aptitude  Test.  The  tests  were 
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subjected  to  a principal-components  analysis,  yielding  two  orthogonal  com- 
ponents, a verbal  one  and  a spatial-abstract  one. 

In  a third  experiment,  premises  were  again  presented  with  abstract 
content.  This  experiment  differed  from  the  first  experiment,  however.  In  that 
(•)  the  subject's  task  was  to  Indicate  whether  a single  presented  conclusion 
was  definitely,  possibly,  or  never  true,  and  (b)  subjects  might  receive  either 
a single  premise  or  a pair  of  premises.  Sixteen  subjects  received  each  of  four 
premises,  such  as  "All  A are  ^",wlth  each  of  four  possible  conclusions,  such 
as  "Some  A are  not  ^, " and  had  to  determine  the  truth  value  of  each  conclusion; 
sixteen  other  subjects  received  15  pairs  of  premises  such  as  "All  are  £.  All 
A are  ^,"  and  had  to  determine  the  truth  value  of  the  conclusion.  Eleven  premise 
pairs  had  at  least  one  valid  conclusion;  four  did  not.  This  decomposition  of  the 
task  permitted  us  to  test  assumptions  of  the  models  regarding  encoding  of  single 
premises  separately  from  assumptions  of  the  models  regarding  combination  of  pairs 
of  premises. 

Results.  Three  sets  of  results  are  of  primary  Interest:  fits  of  the 
models  to  the  data,  parameter  estimates  for  the  preferred  model,  and  relation- 
ships of  parameter  estimates  to  ability  test  scores. 

Fits  of  the  alternative  models  of  categorical  syllogistic  reasoning  to  the 
response-choice  data  are  shown  In  Table  1.  Model  fits  are  expressed  In  terms  of 


Insert  Table  1 about  hero 

2 

proportion  of  variance  In  the  data  accounted  for  by  each  model  (R  ) , and  of 
root-mean-snuare  deviation  of  observed  from  predicted  values  (RMSD) . 


> 


The  results  of  the  experiments,  considered  either  singly  or  as  a whole 
are  unequivocal:  The  transltlve-chaln  model  gave  a better  account  of  the 
response-choice  data  than  did  any  competing  model.  And  the  results  of  the 
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third  experiment  show  that  the  assumptions  of  the  transltlve-chaln  model  are 

plausible  both  for  encoding  considered  alone  and  for  encoding  and  combination 

considered  jointly.  Viewed  by  Itself  without  regard  to  the  other  models,  the 

2 

transltlve-chaln  model  also  did  very  well:  R was  greater  than  .9  for  all 
but  one  data  set  (In  which  It  was  .89). 

Although  the  fits  of  the  transltlve-chaln  model  to  the  data  are  most 
respectable.  It  Is  Important  to  note  that  the  model  could  be  rejected  at  the 
.05  level  or  better  In  every  case.  Thus,  although  the  transltlve-chaln  model 
Is  the  best  of  the  competing  models,  and  shows  respectable  fits  when  considered 
just  on  Its  own.  It  Is  not  the  true  model.  The  most  likely  source  of  Inade- 
quacy seemed  to  us  to  be  the  assumption  that  encoding  Is  always  complete  and 
correct.  We  therefore  tried  relaxing  this  assumption,  estimating  parameters 

for  errors  In  encoding.  Generally,  this  bought  us  about  .02  or  .03  points  of 
2 

R , and  In  about  half  of  the  data  sets  resulted  In  nonrejection  of  the  model. 

2 

But  the  small  Increases  In  R did  not  seem  to  justify  the  Increase  by  over  50% 

In  the  ntimber  of  parameters,  and  so  we  did  not  modify  the  theory. 

Table  2 shows  values  of  parameter  estimates  In  each  of  the  various  experi- 
ments. Parameters  P2,  Pj,  and  p^  were  highly  correlated,  and  were  therefore 

Insert  Table  2 about  here 

combined.  In  general,  the  parameter  estimates  make  good  sense. 

Consider  first  the  p parameters.  The  value  of  p^^  Is  particularly  low  for 
syllogisms  with  factual  content,  suggesting  that  the  working  memory  or  other 
processing  limitations  that  restrict  the  ntmiber  of  set  relations  a subject  can 
combine  are  lessened  when  the  subject  Is  dealing  with  concrete,  factual  content. 


t 
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The  value  of  Is  always  considerably  greater  than  .5,  Indicating  that  given 
a choice  between  a stronger  label  and  a label  that  matches  the  atmosphere  of 
Che  premises,  subjects  prefer  the  label  Chat  matches  the  atmosphere  of  the 
premises.  One  would  expect  $2  quite  close  to  1,  since  It  represents 

subjects'  preferences  for  conclusions  that  both  are  stronger  and  match  the 
atmosphere  of  the  premises.  In  fact,  62^^  quite  close  to  1 In  each  data  set. 
Finally,  we  can  see  that  when  the  representation  for  combined  premises  con- 
tains nonldentlcal  first  components,  subjects  did  show  pronounced  tendencies 
(Indicated  by  nontrivial  values  of  the  c parameter)  to  label  the  final  repre- 
sentation as  Indeterminate. 

We  next  consider  the  relationship  between  the  parameters  of  the  transltlve- 
chaln  model  and  scores  on  the  orthogonal  verbal  and  spatial-abstract  principal 
components.  Means  of  parameter  estimates  In  Experiment  2 were  calculated 
for  subjects  high  (that  Is,  above  the  median)  and  low  (that  Is,  below  the 
median)  on  the  two  components.  The  results  are  shown  In  Table  3,  and  can  be 

Insert  Table  3 about  here 


P 

H 


summarized  briefly.  High  and  low  verbal  subjects  did  not  differ  significantly 
on  any  of  the  parameters.  High  and  low  spaclal-abstract  subjects,  however,  did 
differ  significantly  on  the  p^  (and  hence  P2‘^3‘’T’4  * parameter.  Thus, 

subjects  higher  In  spatial-abstract  ability  were  better  able  to  combine  more 
set  relations,  presumably  because  of  their  ability  to  visualize  more  represen- 

\ 

rations  or  the  same  representations  more  clearly  than  did  the  lower  spatial-abstract 
subjects.  There  was  no  reason  to  expdet  any  differences  In  the  parameters  of  the 
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comparison  stage,  and  none  occurred. 

It  Is  obviously  not  possible  to  describe  here  all  the  data  analyses  we 
performed  and  presented  In  the  original  reports  of  our  results.  Worth  noting, 
however.  Is  the  fact  that  we  formulated  a response  latency  model  from  the 
assumptions  of  the  transltlve-chaln  theory,  and  tested  It  In  Experiment  1,  the 
only  one  of  the  three  experiments  In  which  latency  data  were  collected.  The 
model  accounted  for  80Z  of  the  variance  In  the  latency  data.  Indicating  that 
even  with  response  latencies  as  long  (X  - 43.59  sec)  and  as  variable  (S  ■ 5.34 
sec)  as  those  obtained  for  syllogism  data.  It  is  possible  to  obtain  a good  fit 
of  observed  to  predicted  times. 

Summary.  To  summarize,  we  have  presented  a new  theory  of  categorical 
syllogistic  reasoning,  the  transltlve-chaln  theory,  which  we  have  tested 
on  a variety  of  syllogism  contents  and  response  formats.  The  theory  accounted 
very  well  for  the  response-choice  (and  latency)  data.  The  parameter  estimates 
were  sensible  and  Informative,  and  an  analysis  of  Individual  differences  In 
parameter  estimates  shed  some  light  on  the  kind  of  ability  that  may  distinguish 
good  from  poor  deductive  reasoners. 

4 

Conditional  Syllogisms 
The  Nature  of  Conditional  Syllogisms 

A conditional  syllogism  comprises  three  declarative  statements.  The  first 
statement,  called  the  major  premise,  expresses  a relation  between  two  events, 
for  example,  "If  A,  then  B."  The  second  statement,  called  the  minor  premise, 
asserts  the  truth  or  falsity  of  either  the  antecedent  (first  term)  or  consequent 
(second  term)  of  the  major  premise,  for  example,  "Not  ^. " The  third  statement, 
or  conclusion,  is  either  the  affirmation  or  negation  of  the  term  not  appearing 
In  the  minor  premise,  for  example,  "Not  A." 
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An  Interesting  parallel  exists  between  conditional  syllogisms  and  categori- 
cal syllogisms  of  the  second  type,  which  were  mentioned  earlier  but  not  further 
discussed.  Consider  the  syllogism,  "All  A are  Bi.  X Is  not  a (Therefore,) 

X Is  not  an  A."  If  A Is  taken  to  be  the  set  of  states  of  the  world  In  which 
event  A Is  true,  ^ Is  taken  to  be  the  set  of  states  of  the  world  In  which  event 
B Is  true,  and  X Is  taken  to  be  a particular  state  of  the  world,  then  the 
conditional  and  categorical  syllogisms  become  structurally  Isomorphic.  In- 
deed, the  transltlve-chaln  theory  assumes  that  categorical  syllogisms  of  the  second 
t5rpe  are  represented  and  processed  in  the  same  way  as  conditional  syllogisms. 

The  Trans Itlve-Chaln  Theory  of  Conditional  Syllogistic  Reasoning 

The  transltlve-chaln  theory  applied  to  conditional  syllogisms  (and  cate- 
gorical syllogisms  of  the  second  type)  is  very  similar  to  the  theory  applied 
to  standard  categorical  syllogisms.  First,  the  subject  encodes  both  premises 
completely,  using  the  same  format  for  storing  Information  as  was  described 
earlier.  Then  the  subject  attempts  to  construct  a transitive  chain  Involving 
the  representations  of  the  second  premise  and  one  of  the  components  In  the 
first  premise.  Because  all  major  premises  are  universal  In  problems  of  these 
types,  there  are  a maximum  of  two  possible  representations  of  the  first  premise 
(see  Figure  1),  and  hence  two  sets  of  components.  If  the  first  rule  for  con- 
structing transitive  chains  that  was  described  earlier  permits  formation  of  a 
transitive  chain,  then  the  subject  forms  It  and  completes  solution.  If  the 
first  rule  does  not  apply,  the  subject  has  two  choices.  He  or  she  can  apply 
the  second  rule,  reason  that  no  definite  conclusion  exists,  and  respond  that 
the  given  conclusion  is  logically  Invalid.  Or  the  subject  can  use  indirect 
proof,  trying  to  form  a transitive  chain  Integrating  the  negation  of  the  con- 
clusion with  one  of  the  components  In  the  representation  of  the  major  premise. 

If  such  a transitive  chain  cart  be  formed,  and  if  the  result  contradicts  the 
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representation  of  the  second  premise,  the  subject  can  respond  that  the  con-  | 

elusion  Is  valid.  Otherwise,  the  conclusion  Is  deemed  Invalid.  The  probability 
of  a subject's  using  indirect  proof  and  thus  being  able  to  form  a second  , 

transitive  chain  (given  that  the  first  rule  does  not  apply  In  the  subject's 
Initial  attempt  to  combine  the  two  premises)  depends  upon  the  number  of  nega- 
tions in  the  first  premise.  Parameter  t^  applies  when  there  are  no  negations 
in  the  first  premise,  t^  when  there  Is  one  negation,  and  t2  vhen  there  are 
two  negations. 

Empirical  Tests  of  the  Model  ^ 

Method.  An  experiment  was  conducted  with  SO  adults  from  the  New  Haven 
area.  The  stimuli  were  64  syllogisms,  half  of  which  presented  conditional 

I 

relations  and  half  of  which  presented  categorical  relations  Isomorphic  to  the 
conditional  relations.  The  32  syllogisms  of  each  type  were  constructed  ac- 
cording to  a 2^  design  that  was  exhaustive  with  respect  to  the  possible  Item 
types.  In  these  syllogisms,  (a)  the  first  terra  of  the  major  premise,  (b) 
the  second  term  of  the  major  premise,  (c)  the  single  terra  of  the  minor  premise, 
and  (d)  the  conclusion  were  each  either  affirmative  or  negative,  and  (e) 
the  single  terra  of  the  minor  premise  was  the  same  (disregarding  polarity)  as 

I 

either  the  first  or  second  term  of  the  major  premise.  The  subject's  task  was  ! 

to  label  each  syllogism  as  having  either  a valid  or  an  Invalid  conclusion.  The 
content  In  each  syllogism  was  abstract  (with  the  letters  A and  ^ used  as  terms) . 

All  subjects  received  all  syllogisms  blocked  by  syllogism  type,  and  also  the 
verbal  reasoning,  spatial  visualization,  and  abstract  reasoning  sections  of  the 
Differential  Aptitude  Test. 

Results.  Three  sets  of  results  are  again  of  primary  Interest:  fits  of 
the  model  to  the  data,  parameter  estimates  for  the  model,  and  relationships 
of  parameter  estimates  to  ability  test  scores.  It  Is  also  of  Interest  to  note 
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that  the  correlation  across  the  32  Item  types  for  the  two  kinds  of  syllogisms 

was  .97,  suggesting  that  the  processes  used  to  solve  syllogisms  of  the  two 

types  probably  were  quite  similar.  If  not  practically  Identical. 

The  transltlve-chaln  model  provided  an  excellent  fit  to  the  response- 

choice  data  for  both  conditional  and  categorical  syllogisms.  For  the  condl- 
2 

tlonal  problems,  R was  .95  and  RMSD  was  .10;  for  the  categorical  problems, 

2 

R was  .97  and  RMSD  was  .07.  As  In  the  earlier  experiments,  however,  the 
fit  of  the  model  to  each  set  of  data  could  be  rejected  at  the  .05  level.  In- 
dicating that  the  transitive-chain  model,  although  a close  approximation  to 
the  true  model,  is  not  Identical  to  the  true  model. 

Parameter  estimates  for  the  conditional  syllogisms  were  .36  for  Pj^,  .64 
for  P2,  .52  for  tg,  .48  for  tj^,  and  .15  for  t2.  (Parameters  p^  and  p^  are  Ir- 
relevant in  this  type  of  syllogism,  because  there  are  never  more  than  two 
possible  set  relations  to  combine;  parameters  Bj^,  62,  and  c are  irrelevant, 
because  the  presentation  of  only  a single  conclusion  in  this  experiment  obvi- 
ates the  need  for  a comparison  stage.)  Parameter  estimates  for  the  structurally 
Isomorphic  categorical  syllogisms  were  .43  for  Pj^,  .57  for  P2,  .60  for  tg,  .61 
for  tj^,  and  .16  for  t2. 

Comparison  of  the  value  of  Pj^  In  this  experiment  with  that  of  p^^  in  the 
first  experiment  with  categorical  syllogisms  of  the  first  kind  reveals  that 
with  content  type  held  constant,  subjects  combine  more  representations  for 
problems  of  the  types  used  in  this  experiment  than  for  problems  of  the  type  used 
In  that  experiment.  This  result  Is  a most  sensible  one,  since  the  representation 
of  the  minor  premise  in  problems  of  the  present  types  Is  simpler  than  the  re- 
presentation of  the  minor  premise  In  problems  of  the  previous  type.  In  the 
present  problems,  the  minor  premise  consists  merely  of  a single  term  (conditionals) 
or  indication  of  set  membership  (categorlcals) , whereas  In  the  previous  problems 
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the  minor  premise  consisted  of  a quantified  relation  between  two  sets. 

Ue  assume  that  subjects  have  a fixed  amount  of  processing  capacity  that 
they  can  devote  to  each  problem,  and  that  Increased  consumption  of  processing 
Capacity  for  one  kind  of  operation  results  In  decreased  processing  capacity 
left  over  for  other  kinds  of  operations.  Using  this  reasoning,  we  had 
expected  the  values  of  tg,  t^,  and  t2  to  be  successively  smaller:  The  In- 
creased processing  capacity  allocated  to  comprehension  of  negations  In  the 
major  premise  was  expected  to  leave  decreased  processing  capacity  to  allocate 
to  forming  a second  transitive  chain  from  the  negation  of  the  conclusion. 

Instead,  the  values  of  t^  and  tj^  were  approximately  equal,  whereas  the  value 
of  tj  was  Indeed  considerably  lower.  Apparently,  double  negations  cause 
considerably  more  difficulty  for  subjects  relative  to  single  negations  than 
do  single  negations  relative  to  straightforward  affirmations. 

Next,  we  turn  to  comparison  of  the  orthogonal  verbal  and  spatial-abstract 
principal  component  scores  for  high  and  low  verbal  subjects  and  for  high  and 
low  spatial-abstract  subjects.  Our  general  expectation  was  that  parameters 
reflecting  processing  capacity  (those  relevant  to  the  combination  stage)  would 
differ  In  value  across  ability  groups,  whereas  those  parameters  merely  reflecting 
biases  In  response  choice  (those  relevant  to  the  comparison  stage)  would  not  differ 
In  value  across  ability  groups.  Because  the  representation  of  Information  com- 
bined Is  assumed  to  be  symbolic,  our  particular  expectation  was  that  larger 
differences  would  be  obtained  between  the  two  spatial-abstract  groupings  than 
between  the  two  verbal  groupings.  The  results  of  the  previous  experiment  con- 
firmed both  the  general  and  specific  expectations,  and  the  results  of  the 
present  experiment  do  as  well.  As  In  the  previously  described  experiment,  the 
values  of  pj^  for  high  and  low  verbal  subjects — .38  and  .43 — did  not  differ  sig- 
nificantly; the  values  of  pj^  for  high  and  low  spatial-abstract  subjects — .35  and 
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.52 — did  differ  significantly.  Similarly,  the  values  of  the  t parameters 
(which  are  combination-stage  parameters)  did  not  differ  significantly  across 
high  and  low  verbal  subjects — .54  and  .55  for  .50  and  .55  for  tj^,  and  .16 
and  .17  for  t2;  they  did  differ  significantly  across  high  and  low  spatial- 
abstract  subjects — .66  and  .43  for  t^,  .63  and  .42  for  t^^,  and  .22  and  .11 
for  t2>  The  results  of  both  experiments  thus  confirm  that  (a)  parameters 
measuring  processing  capacity  vary  with  spatial-abstract  ability,  whereas 
parameters  not  measuring  processing  capacity  do  not  vary  with  this  ability, 
and  (b)  no  parameters  vary  with  verbal  ability.  These  results  provide  fur- 
ther support  for  the  kind  of  symbolic  representation  and  for  the  identification 
of  processes  proposed  by  the  transltlve-chaln  theory. 

As  in  Experiment  1 for  categorical  syllogisms  of  the  first  type,  a response- 

latency  model  was  formulated  on  the  basis  of  the  transltlve-chaln  theory.  The 
2 

values  of  R for  this  model  were  .91  for  conditional  syllogisms  and  .84  for 
categorical  syllogisms  of  the  second  type.  The  model  thus  provides  a good  fit 
to  the  latency  data. 

Summary . To  summarize,  we  have  presented  an  extension  of  the  transltlve- 
chaln  theory  to  conditional  syllogisms  and  to  categorical  syllogisms  of  the 
second  type.  The  two  problems  were  proposed  to  be  structurally  Isomorphic, 
and  the  high  correlation  between  response-choice  data  supports  a claim  of 
psychological  as  well  as  structural  Isomorphism.  The  transitive-chain  theory 
well  accounted  for  response-choice  (and  latency)  data.  The  parameter  estimates 
again  shed  light  on  the  ways  In  which  subjects  process  information,  and  the 
analysis  of  Individual  differences  In  parameter  estimates  provided  Indirect 
support  for  representational  and  processing  assumptions  of  the  transltlve-chaln 
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Linear  Syllogisms^ 

The  Nature  of  Linear  Syllogisms 

A linear  syllogism  comprises  two  premises  and  a question.  Each  of  the 
premises  describes  a relation  between  two  Items,  with  one  of  the  items  over- 
lapping between  the  two  premises.  The  subject':,  task  Is  to  use  this  overlap 
to  determine  the  relation  between  the  two  Items  not  occurring  in  the  same 
premise.  Determination  of  this  relation  enables  the  subject  to  answer  the 
question.  In  the  linear  syllogism,  "C  is  not  as  tall  as  A is  not  as  short 
as  Who  is  shortest?"  the  subject  must  determine  that  ^ is  the  overlapping 
term,  and  that  since  ^ is  shorter  than  A and  ^ is  taller  than  C,  £ is  shorter 
than  A.  Hence,  C is  shortest. 

Whereas  subjects  show  a rather  wide  range  of  responses  in  their  solutions 
to  particular  categorical  and  conditional  syllogisms,  they  show  little  variation 
In  their  response  choices  for  linear  syllogisms.  In  four  experiments  where 
subjects  were  told  to  emphasize  accuracy  of  response  (Sternberg,  Note  3),  9955 
of  the  responses  to  the  questions  were  correct.  Hence,  the  priorities  in 
modeling  linear-syllogism  data  are  reversed  from  those  of  categorical  and  con- 
ditional syllogisms.  The  primary  goal  is  to  model  response  latency,  and  the 
secondary  goal  to  model  errors. 

A Mixture  Theory  of  Linear  Syllogistic  Reasoning 

Representation  of  Information.  According  to  the  proposed  theory,  two 
types  of  representations  are  used  In  the  solution  of  linear  syllogisms  (and 
hence  the  name  "mixture  theory").  First,  subjects  are  hypothesized  to  decode 
the  premises  of  the  syllogism  into  a linguistically-based,  deep-structural 
proposition  of  the  type  originally  proposed  by  Chomsky  (1965).  A premise  such 
as  "John  is  taller  than  Mary,"  for  example,  would  be  represented  as  (John  Is 
tall+;  Mary  is  tall)  (see  Clark,  1969).  Next,  subjects  are  hypothesized  to 
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recode  the  deep  structural  representation  Into  a spatial  array  that  functions 
as  an  Internal  analogue  to  a physically  realizable  array.  In  such  an  array, 

John  would  be  placed  above  Mary, 

According  to  the  mixture  theory  (as  proposed  by  Sternberg,  Note  3,  and 
modified  by  Sternberg,  Note  ^) , as  many  as  lO  component  processes  may  be 
required  to  solve  linear  syllogisms  of  various  kinds.  These  processes  will 
be  Illustrated  with  reference  to  the  example  problem  cited  above  Is  not 
as  tall  as  A is  not  as  short  as  Who  Is  shortest?). 

1.  Premise  reading  (mandatory).  The  sub.iect  reads  each  of  the  two  premises, 
"C  Is  not  as  tall  as  and  "A  Is  not  as  short  as  " comprehending  their  surface 
structure. 

2.  Linguistic  decoding  of  comparative  relation  (mandatory).  The  subject 
decodes  the  surface-structural  form  Into  a deep-structure  proposition  relating 
the  two  terms  of  the  premise.  Decoding  of  a premise  with  a marked  adjective 
(such  as  short)  Is  assumed  to  take  longer  than  decoding  of  a premise  with  an 
unmarked  adjective  (such  as  tall) . In  the  example,  the  first  premise  is 
decoded  Into  the  form  (C  Is  tall+;  B is  tall) ; the  second  premise  Is  decoded 
Into  the  form  (A  Is  short+;  ^ Is  short).  Note  that  at  this  point,  only  the 
comparative  and  not  the  negative  has  been  processed,  so  that  the  deep-structural 
propositions  do  not  accurately  represent  the  content  of  the  premises. 

3.  Decoding  of  negation  (optional).  If  a premise  Is  a negative  equatlve, 

that  is,  one  with  the  relation  "not  as  as,"  It  is  necessary  to  reformulate 

the  deep-structural  decoding  of  the  premise  to  take  the  negation  Into  account. 

The  roles  of  the  terms  In  the  propositions  are  reversed,  so  that  the  first 
proposition  becomes  (^  Is  tall+;  £ Is  tall)  and  the  second  one  becomes  (^  is 
short+;  A Is  short) . 

4 . Spatial  serlatlon  of  comparative  relation  (mandatory) . Having  decoded 
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the  premises  into  deep-structural  propositions,  the  subject  is  now  able  to 
seriate  the  terras  of  each  premise  spatially.  A propositional  encoding  is 
assumed  to  be  prerequisite  for  spatial  seriation.  The  subject  may  seriate 
the  two  terms  of  each  premise  in  either  a preferred  (usually  top-down)  or 
nonpreferred  (usually  bottom-up)  direction.  It  is  assumed  that  the  subject's 
choice  of  direction  depends  upon  whether  or  not  the  adjective  in  the  original 
premise  was  marked  or  not.  The  preferred  direction  is  used  for  unmarked  adjec- 
tives, the  nonpreferred  direction  for  marked  adjectives.  In  the  e:tample, 

^ and  ^ are  seriated  top-down  into  one  spatial  array,  . R and  A are  seriated 
bottom-up  into  a second  spatial  array,  . 

5.  Pivot  search  (optional).  Once  the  subject  has  seriated  the  terras 
in  each  of  the  two  premises  into  two  spatial  arrays,  the  subject  must  locate 
the  middle  (pivot)  term  that  will  enable  him  or  her  to  combine  the  two  ar- 
rays into  a single  array.  The  pivot  is  assumed  to  be  immediately  available  if 
either  (a)  it  appears  in  two  affirmative  premises,  or  (b)  it  was  the  last 
terra  to  be  seriated  in  a negative  equative.  (The  principles  behind  this  availa- 
bility are  described  in  Sternberg,  Note  3.)  In  the  example,  the  last  term  to 
have  been  seriated  was  A (the  tallest  term).  The  subject  inquires  whether  A is 
the  pivot.  Since  it  is  not,  the  subject  must  use  additional  time  locating  the 
pivot,  which  is  the  only  term  that  appears  in  both  premises. 

6.  Seriation  of  the  two  arrays  into  a single  array  (mandatory) . Having 
found  the  pivot,  the  subject  is  prepared  to  combine  the  two  separate  arrays 
into  a single.  Integrated  spatial  array.  The  subject  combines  the  two  single 
arrays  according  to  the  order  of  the  original  premises.  Combination  of  these 
arrays  is  assumed  to  be  less  susceptible  to  error  (although  not  less  time-con- 
suming) if  the  first  term  to  be  combined  (which  is  always  the  first  terra  in 
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the  final  deep-structural  proposition  describing  the  first  premise)  Is  the 
terra  that  Is  most  current  In  working  memory,  namely,  the  pivot  (from  the  Im- 
mediately preceding  operation  5).^  In  the  example,  the  subject  starts  seria- 
tlon  with  the  ^ term  as  encoded  from  the  bottom  half  of  the  array,  and  ends 

up  in  the  top  half  of  the  array,  Thus,  the  subject  links  the  second  pair 

® A 

of  terms,  A and  Z,  to  the  first  pair,  C and  B,  forming  the  spatial  array,  B. 

C 

7.  Question  reading  (mandatory).  Next,  the  subject  must  read  the  ques- 
tion that  he  or  she  will  be  required  to  answer.  If  the  question  contains  a 
marked  adjective,  as  does  the  question  in  the  example,  it  is  assumed  to  take 
longer  to  decode,  and  the  subject  is  assumed  to  have  to  search  for  the  response 
to  the  question  In  the  nonpreferred  end  of  the  array.  A marked  adjective  In 
the  question,  therefore.  Increases  response  latency.  The  example  question, 

"Who  Is  shortest?",  contains  such  an  adjective. 

8.  Response  search  (optional).  After  seriatlon  was  completed  (Operation 
6),  the  "mind's  eye"  of  the  subject  ended  up  either  in  the  top  or  bottom  half  of 
the  spatial  array.  If  the  question  has  as  its  answer  the  term  that  is  in  the 
half  of  the  array  in  which  the  subject's  mind's  eye  ended  up,  then  the  response 
is  immediately  available.  If  the  answer  term  is  in  the  other  half  of  the  array, 
however,  then  the  response  is  not  available  and  must  be  sought.  This  search 
requires  additional  time.  In  the  example,  the  subject  ended  up  in  the  top 

half  of  the  array,  completing  seriatlon  with  the  A and  B terras.  The  question, 
however,  asks  who  is  shortest.  The  subject  must,  therefore,  search  for  the 
response,  finding  it  in  the  bottom  half  of  the  array. 

9.  Establishment  of  congruence  (optional).  The  processes  described 


above  are  sufficient  to  establish  a correct  answer,  and  under  some  circumstances, 
a response  is  immediately  forthcoming.  If,  however,  subjects  wish  to  check  the 
accuracy  of  the  response  obtained  by  interrogation  of  their  spatial  arrav,  they 
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have  available  to  them  their  propositional  representation  by  which  they  can 
verify  their  response.^  If  the  linguistic  encoding  of  the  proposed  response  is 
congruent  with  the  linguistic  encoding  of  the  corresponding  term  of  the 
proposition,  then  the  response  immediately  passes  the  congruence  check.  If  the 
two  are  incongruent,  however,  congruence  of  the  response  term  to  the  proposi- 
tional term  is  established,  taking  additional  time.  In  the  example,  C_,  the 
shortest  term,  was  described  as  tall  (relative  to  which  was  tall+) . The 
question,  however,  asks  who  is  shortest.  Congruence  must  therefore  be  es- 
tablished by  formulating  the  question  in  terms  of  who  is  least  tall. 

10.  Response  (mandatory).  The  final  operation  is  response,  whereby 
the  subject  communicates  his  or  her  choice  of  an  answer.  In  the  example, 
the  subject  responds  with  £. 

Alternative  Theories  of  Linear  Syllogistic  Reasoning 

The  mixture  theory  was  compared  to  two  other  theories  of  linear  syllogis- 
tic reasoning,  a spatial  theory  based  upon  the  theories  of  OeSoto,  London,  and 
Handel  (1965)  and  Huttenlocher  (TIuttenlocher , 1968;  Huttenlocher  & Higgins,  1971), 
and  a linguistic  theory  based  upon  the  theory  of  Clark  (1969).  Although  the 
alternative  theories  as  formulated  here  were  based  upon  previous  theories,  they 
were  not  Identical  to  them.  The  alternative  theories  were  not  specified  in 
a form  sufficiently  rigorous  to  permit  quantification,  and  in  order  to  permit 
precise  comparison  of  theories,  additional  assumptions  had  to  be  made  that  did 
enable  quantification.  Although  the  alternative  theories  as  presently  formulated 
are  not  identical  to  the  previous  theories,  they  do  seem  to  capture  many  of  the 
major  intuitions  of  these  previous  theories. 

The  theories  to  be  compared  all  agree  that  there  are  certain  encoding,  nega- 
tion, marking,  and  response  operations  that  contribute  to  the  latencv  with 
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Which  a subject  solves  a linear  syllogism.  All  linear  syllogisms  contain  certain 
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terms  and  relations  to  be  encoded,  and  require  a response.  Only  some  linear 
syllogisms  contain  premises  with  negations  and  marked  adjectives . Although 
the  theories  agree  on  the  presence  of  these  operations,  they  disagree  as  to 
which  of  the  operations  are  spatial  and  which  are  linguistic.  The  theories  also 
disagree  as  to  what  further  operations  are  required.  This  divergence  is  par- 
ticularly Important,  since  it  provides  the  basis  for  distinguishing  among 
theories.  Because  the  theories  are  partlallv  nonoverlapping  in  the  operations 
alleged  to  be  used  in  solving  linear  syllogisms,  the  theories  make  different 
latency  predictions  across  item  types. 

Under  certain  circumstances  (described  in  Sternberg,  Note  3),  the  mixture 
theory  has  one  more  parameter  (seven)  than  do  the  spatial  and  linguistic  theories 
(six).  As  will  be  shown,  however,  the  presence  of  the  additional  parameter 
(the  optional  parameter  representing  the  time  to  establish  congruence)  never 
changes  the  rank  order  of  the  model  fits  to  the  latency  data. 

Empirical  Tests  of  the  Theories 

Method . Five  experiments  were  conducted  with  college  undergraduates  that 
were  designed  to  distinguish  among  the  mixture,  spatial,  and  linguistic  theories. 
All  of  the  experiments  Involved  presentation  of  32  basic  types  of  linear  syllo- 
gisms with  three  different  adjective  pairs  (usuallv  taller-shorter,  better-worse, 
and  faster-slower) . The  length  of  the  experiments  ranged  from  one  to  three  ses- 
sions, and  all  experiments  Included  administration  to  each  subject  of  tests  of 
verbal  reasoning,  spatial  visualization,  and  abstract  reasoning  abilities. 

In  the  first  experiment,  16  Stanford  undergraduates  received  linear  syl- 
logisms such  as  "Sam  is  taller  than  Joe.  Joe  is  taller  than  Bob.  Who  is  tallest? 
Joe  Bob  Sam."  Items  were  presented  to  all  subjects  in  both  of  two  cueing  condi- 
tions. In  the  first  condition,  subjects  received  a blank  field  in  the  first  part 
of  a trial.  Subjects  indicated  readiness  to  see  the  item  by  pressing  a foot  pedal. 
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and  following  this  Indication  of  readiness  the  entire  Item  appeared  on  a ta- 
chistoscope  screen.  In  the  second  condition,  subjects  received  the  first  two 
premises  of  the  syllogism  In  the  first  part  of  the  trial.  Subjects  processed 
the  premises  as  fully  as  they  could,  and  then  pressed  the  foot  pedal,  resulting 
In  the  appearance  of  the  entire  Item  on  the  screen. 

In  the  second  experiment,  the  linear  syllogisms  were  presented  to  18  Yale 
undergraduates  with  the  question  first:  "\'/ho  Is  tallest?  Sam  Is  taller  than  Joe. 
Joe  Is  taller  than  Bob.  Joe  Sara  Bob."  In  this  experiment,  there  were  three 
rather  than  two  precueing  conditions.  Subjects  received  either  a blank  field, 
just  the  question,  or  the  question  and  the  two  premises  In  the  first  part  of 
the  trial.  They  always  received  the  whole  item  In  the  second  part  of  the  trial. 

In  the  third  experiment,  the  linear  syllogisms  (with  question  last)  were 
presented  to  18  Yale  undergraduates  without  precueing.  However,  subjects  also 
received  eight  basic  types  of  two-term  series  problems,  for  example,  "Jim  Is 
taller  than  Bob.  Who  Is  tallest?  Jim  Bob." 

The  fourth  experiment  was  similar  to  the  third  experiment,  except  that 
each  of  the  5A  Yale  undergraduates  participating  received  two-term  series  problems 
and  linear  syllogisms  (which  are  also  known  as  three-term  series  problems)  with 
just  one  of  the  three  adjective  pairs,  rather  than  with  all  three  as  In  the 
previous  experiments. 

The  fifth  experiment  was  also  similar  to  the  third  experiment,  except  that 
the  18  Yale  summer  session  students  were  encouraged  to  solve  Items  rapidly,  and 
a bonus  was  paid  to  encourage  more  rapid  (and  hence  less  accurate)  performance. 

The  speed-accuracy  tradeoff  manipulation  proved  to  be  successful:  Mean  solution 
latencies  decreased  by  about  a second  (from  approximately  seven  to  approximately 
six  seconds),  and  mean  error  rates  Increased  from  17,  In  the  previous  experiments 


to  77,  In  this  experiment. 
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Results.  As  in  the  previous  analyses,  we  shall  be  concerned  with  fits 
of  the  quantified  models  to  the  data,  parameter  estimates,  and  relations  be- 
tween parameter  estimates  and  ability  test  scores.  Because  of  space  limi- 
tations, we  shall  present  only  model  fits  for  the  zero-cue  condition  (blank 
field  in  the  first  part  of  the  trial). 

2 

Table  4 presents  model  fits  (in  terms  of  R ) for  the  latency  data  from 
each  of  the  five  experiments.  In  each  experiment,  the  mixture  theory  is  clearly 

Insert  Table  4 about  here 

superior  to  either  the  linguistic  or  spatial  theory:  The  differences  in  R be- 
tween the  mixture  theory  and  the  second  best  theory  (the  linguistic  theory  in 
four  of  the  five  experiments)  were  .213,  .148,  .155,  .240,  and  .237  in  Experi- 
ments 1,  2,  3,  4,  and  5 respectively.  Thus,  regardless  of  whether  the  ques- 
tion came  before  or  after  the  premises,  of  whether  or  not  precuelng  was  part  of 
the  experimental  design,  of  whether  different  adlectlves  were  presented  within 
or  between  subjects,  and  of  whether  subjects  emphasized  speed  or  accuracy, 
the  mixture  theory  best  accounted  for  the  data.  The  optional  parameter  for 
establishment  of  congruence  was  relevant  to  performance  in  Experiments  3,  4,  and 
5.  With  this  parameter  deleted,  the  values  of  R for  the  mixture  theory  were 
.765,  .832,  and  .761  in  Experiments  3,  4,  and  5 respectively.  Thus,  even  with- 
out the  optional  parameter,  the  mixture  theory  was  clearly  superior  to  its  com- 
petitors. Moreover,  this  superiority  held  up  in  every  comparison  for  every 
adjective,  session,  and  with  precueing  conditions  Included  in  the  analysis.  It 
should  be  noted,  though,  that  the  mixture  theory  could  be  rejected  relative  to 
the  true  theory  in  all  but  the  first  experiment:  The  unexplained  variance  was 
statistically  significant  in  four  of  the  five  experiments.  Thus,  the  mixture 
theory,  although  the  best  available  approximation  to  the  true  theory,  is  not 


identical  to  it 
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Table  5 presents  parameter  estimates  for  the  various  operations  that  could 
be  separated  In  each  of  the  five  experiments.  The  ENC+  (encoding  plus)  parame- 

Insert  Table  5 about  here 

ter  includes  a combination  of  times  for  between-premlse  seriatlon,  incremental 
serlatlon  of  marked  adjectives  in  the  nonpreferred  direction,  premise  reading, 
and  encoding  of  unmarked  adjectives.  The  first  two  processes  are  hypothesized 
to  be  spatial,  and  to  account  for  most  of  the  estimated  time.  The  second  two 
processes  are  hypothesized  to  be  linguistic.  ENC+j^  differed  significantly 
from  0 in  both  experiments  in  which  it  was  estimated  (1  and  2),  and  was  esti- 
mated at  about  4650  msec.  ENC+2,  comprising  slightly  fewer  operations,  was 
estimated  at  about  3050  msec  in  the  experiments  with  standard  speed-accuracy 
tradeoff.  It  seems  unlikely  that  the  small  difference  in  the  composition  of 
ENC+j^  and  ENCd-^  (see  Sternberg,  Note  3)  could  account  for  the  large  difference 
in  estimated  values.  Rather,  it  seems  more  likely  that  encoding  operations  were 
performed  more  rapidly  in  Experiments  3 and  4,  where  ENC+2  was  estimated  (under 
standard  speed-accuracy  tradeoff),  than  in  Experiments  1 and  2,  where  ENC+j^  was 
estimated  (also  under  standard  speed-accuracy  tradeoff).  The  obtained  difference 
is  exactly  as  predicted  by  the  mixture  theory,  according  to  which  encoding  should 
be  more  rapid  and  less  careful  in  experimental  paradigms  leading  to  the  use  of 
the  optional  operation  for  the  establishment  of  congruence.  In  Experiments  1 and 
2,  the  use  of  precuelng  presumably  encouraged  subjects  to  encode  the  premises 
fully  before  indicating  readiness  to  see  the  question  and  solve  the  problem. 

In  Experiments  3 and  4,  there  was  no  precueing  in  which  subjects  could  take  as 
long  as  they  needed  to  get  a sharp  spatial  encoding.  Hence,  subjects  are 
likely  to  have  encoded  the  items  more  quickly  and  less  sharply,  at  the  expense 
of  needing  the  extra  check  for  congruence  at  the  end. 
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It  was  possible  to  estimate  unconfounded  durations  of  negation,  marking, 
pivot  search,  and  response  search  times  In  all  five  experiments.  Estimates 
(for  standard  speed-accuracy  tradeoff)  of  negation  time  center  around  350  msec, 
of  marking  time  around  400  msec,  of  pivot  search  time  around  1100  msec,  and 
of  response  search  time  around  500  msec.  Question  reading  time  (plus  con- 
founded operations)  could  be  estimated  only  In  the  second  experiment,  and 
appears  to  be  about  400  msec.  Response  time  Is  about  800  msec. 

For  the  most  part,  the  group  parameter  estimates  are  reasonable  and  In 
close  agreement  across  data  sets.  The  two  exceptions  (for  standard  speed- 
accuracy  tradeoff)  are  that  negation  time  was  Inexplicably  low  In  Experiment  3, 
and  response  search  time  was  Inexplicably  low  In  Experiment  1. 

Correlations  between  parameter  estimates  and  composite  ability  test  scores 
for  the  first  four  experiments  are  shown  In  Table  6.  Data  from  the  fifth  experl- 

Insert  Table  6 about  here 

ment  were  excluded  because  the  unique  speed-accuracy  tradeoff  In  this  experiment 
rendered  parameter  estimates  noncomparable  to  parameter  estimates  In  the 
previous  experiments. 

The  encoding  parameter  (ENC+)  was  significantly  correlated  with  scores 
on  all  three  types  of  ability  tests.  This  pattern  is  consistent  with  the  mix- 
ture theory,  according  to  which  the  ENC+  parameter  Includes  both  linguistic  and 
spatial-abstract  processes.  A strictly  linguistic  or  spatial  theory  would  have 
difficulty  accounting  for  this  pattern.  Although  ENC+  contains  a mixture  of 
operations,  the  predominant  operation,  according  to  the  mixture  theory.  Is 
spatial  serlatlon  between  premises.  This  mixture  theory  therefore  predicted  that 
the  spatial-abstract  correlations  will  be  higher  than  the  verbal  correlations. 


and  this  was  in  fact  ttie  case. 
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The  negation  parameter  (NEG)  showed  significant  correlations  with  the  spa- 
tial and  abstract  composite  scores  but  not  with  the  verbal  composite.  This  pat- 
tern of  correlations  was  Inconsistent  with  the  prediction  of  the  mixture  theory, 
according  to  which  negation  was  supposed  to  be  a linguistic  operation.  It  now 
appears  that  negation  Is  accomplished  spatially  b^  reversing  the  positions  of 
the  two  relevant  terms  In  a wlthln-premlse  spatial  array. 

The  marking  parameter  (MARK)  showed  some  relationship  to  all  three  com- 
posite ability  scores,  as  predicted  by  the  mixture  theory  but  neither  the 
spatial  nor  linguistic  theories.  It  thus  appears  that  marked  adjectives  are 
both  linguistically  more  difficult  to  encode  and  spatially  more  difficult  to 
seriate  In  an  array. 

Pivot  search  (PSM)  was  significantly  correlated  with  the  spatial  and 
abstract  composites  but  not  with  the  verbal  composite.  This  pattern  of  correla- 
tions was  consistent  with  the  mixture  theory,  according  to  which  pivot  search 
Is  a spatial-abstract  operation. 

Response  search  (RS)  was  significantly  correlated  with  all  three  composite 
scores.  The  significant  correlation  with  verbal  ability  came  as  a surprise, 
since  response  search  Is  postulated  by  the  mixture  theory  to  be  a spatial  operation. 
A possible  explanation  of  the  correlation  with  the  verbal  composite  Is  that 
subjects  may  differ  In  the  rates  at  which  they  read  off  names  from  a spatial  array, 
resulting  In  Individual  differences  along  a verbal  dimension. 

Search  for  congruence  (NCON)  was  significantly  correlated  with  the  verbal 
composite,  but  with  neither  the  spatial  nor  the  abstract  composites.  This  corre- 
lational pattern  Is  as  predicted  by  the  mixture  theory,  which,  like  the  linguistic 
theory,  postulates  that  the  search  for  congruence  Is  a linguistic  operation. 

Finally,  response  (RES+)  was  significantly  correlated  with  the  verbal  com- 
posite but  not  with  either  the  spatial  or  abstract  composite.  Response  was  a 
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confounded  parameter  containing  mostly  linguistic  operations  (see  Sternberg, 

Note  3),  and  hence  this  pattern  of  correlations  was  consistent  with  the  theory. 

Generally  speaking,  the  results  of  the  individual-difference  analysis  were 
consistent  with  the  predictions  of  the  mixture  theory,  according  to  which 
particular  operations  should  show  patterns  of  individual  differences  along 
either  verbal,  spatial-abstr  -^t,  or  both  lines.  The  two  exceptions  to  the 
predictions  suggest  a need  for  slight  reconceptualization,  which  in  the  present 
analysis  was  of  necessity  ad  hoc. 

Error  rates  in  the  first  four  experiments  were  too  low  to  permit  analysis. 

A detailed  analysis  of  error  rates  In  the  fifth  experiment  is  presented  else- 
where (Sternberg,  Note  4),  but  will  not  be  discussed  here. 

Summary.  The  results  of  five  experiments  provide  strong  support  for 
the  mixture  theory,  considered  either  by  itself  or  in  comparison  to  alterna- 
tive theories  of  linear  syllogistic  reasoning.  Parameter  estimates  for  the 
mixture  theory  were  sensible  and  generally  consistent  across  experiments,  and 
patterns  of  individual  differences  generally  supported  predictions  as  to 
which  operations  were  spatial  and  which  linguistic. 

Conclusions  and  Current  Directions 

The  transitive-chain  and  mixture  theories  provide  plausible  and  empirically 
sound  accounts  of  reasoning  with  three  kinds  of  syllogisms.  Although  neither 
theory  is  "true"  in  the  sense  of  accounting  for  all  reliable  variance  in  the 
data,  each  theory  is  superior  to  any  of  the  currently  available  competitors. 
Thus,  each  theory  has  an  interesting  story  to  tell,  but  neither  story  is  the 
final  one.  These  theories  will  presumably  go  the  way  most  theories  have  in  the 
past,  and  eventually  be  replaced  by  better  theories. 

Our  present  research  is  following  three  principal  directions,  the  first 
of  which  is  an  attempt  to  show  that  the  transitive-chain  and  mixture  theories 


I 


Deductive  Reasoning 
31 


are  both  special  cases  of  a more  general  theory  of  deductive  reasoning,  the 
second  of  which  Is  an  attempt  to  extend  the  theories  to  prose  processing,  and 
the  third  of  which  Is  an  analysis  of  the  development  of  deductive  reasoning. 

We  propose  that  the  transit Ive-chaln  and  mixture  theories  are  both  spe- 
cial cases  of  a general  theory  of  deduction.  We  are  currently  studying  two 
tasks  that  we  believe  Integrate  Information  processes  from  the  two  theories. 

In  both  tasks,  subjects  receive  two  premises  such  as  "All  gleebs  are  taller 
than  some  f ricks.  Some  f ricks  are  taller  than  all  qulrps."  Note  that  these 
premises  resemble  categorical  syllogisms  In  the  use  of  the  quantifiers  all 
and  some,  but  resemble  linear  syllogisms  In  the  use  of  linear  relational 
orderings.  Like  both  categorical  and  linear  syllogisms,  some  of  the  Items 
Involve  negations  and  others  do  not.  Subjects  participating  In  two  experi- 
ments have  to  perform  either  of  two  tasks.  In  one  task,  subjects  must  answer 
a question  such  as  "Which  are  tallest?  All  gleebs.  Some  gleebs.  All  frlcks. 
Some  frlcks.  All  qulrps.  Some  qulrps.  Can't  tell."  This  task  Is  similar  to 
that  subjects confront  In  solving  linear  syllogisms.  The  other  task  presents 
four  conclusions  and  the  possibility  of  an  Indeterminacy:  "All  gleebs  are 
caller  Chan  all  qulrps.  All  gleebs  are  taller  than  some  qulrps.  Some  gleebs 
are  Caller  than  all  qulrps.  Some  gleebs  are  taller  chan  some  qulrps.  Can't 
tell."  Subjects  choose  the  best  conclusion.  This  task  Is  similar  to  that 
subjects  confront  In  solving  categorical  syllogisms.  We  expect  that  an 
account  of  the  data  from  the  two  tasks  will  require  a generalization  Including 
components  of  both  the  transltlve-chaln  and  mixture  theories. 

Subjects  may  reason  quite  differently  when  presented  with  syllogisms 


In  the  format  of  reasoning  problems  from  the  way  they  do  when  presented  with 
Implicit  syllogisms  embedded  In  their  everyday  reading.  For  this  reason,  we 
are  Investigating  subjects'  strategies  for  solving  syllogisms  when  the  syllogisms 
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are  presented  Implicitly  In  the  context  of  articles  such  as  would  be  found 
In  newspapers  or  magazines,  and  when  the  questions  requiring  solution  of  the 
syllogisms  are  embedded  In  the  midst  of  other,  more  straightforward  reading 
comprehension  questions. 

Finally,  three  experiments  are  underway  that  Investigate  the  development 
of  categorical,  conditional,  and  linear  syllogistic  reasoning.  Our  goal  In 
these  experiments  Is  to  determine  what  It  Is  that  develops  with  time.  The 
studies  Investigate  cognitive  development  within  the  componentlal  framework 
outlined  In  previous  work  (Sternberg,  1977a,  1977b,  In  press;  Sternberg  & 

Rif  kin.  Note  5). 

Neither  the  experiments  we  have  done  to  date,  nor  those  currently  planned, 
exhaust  the  problem  domain  of  deductive  reasoning.  We  believe,  however,  that 
we  have  made  a good.  If  modest,  start  toward  an  understanding  of  the  represen- 
tations and  processes  subjects  use  In  solving  a variety  of  deduction  problems. 
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^The  research  summarized  here  Is  presented  In  detail  In  Cuyote  and 

Sternberg  (Note  1)  and  In  Sternberg  and  Turner  (Note  2) . 

2 

The  combination  process  Is  actually  somewhat  more  complex  than  can  be 

described  here.  See  Cuyote  and  Sternberg  (Note  1)  for  details. 

3 

Thirty-two  other  subjects  In  this  experiment  received  a slightly  dif- 
ferent task.  See  Sternberg  and  Turner  (Note  2)  for  details. 

4 

The  research  summarized  here  Is  presented  In  detail  In  Cuyote  and  Sternberg 

(Note  1) . 

5 

The  research  summarized  here  Is  presented  In  detail  In  Sternberg  (Note  3, 

Note  4). 

^The  differential  difficulty  of  problems  In  which  the  pivot  is  or  Is  not  the 

terra  current  In  working  memory  was  previously  referred  to  as  linguistic  pivot  search 
(Sternberg,  Note  3,  Note  4). 

The  precise  circumstances  under  which  the  optional  operation  for  es- 
tablishing congruence  Is  used  are  described  in  Sternberg  (Note  3) . 

g 

The  number  of  parameters  Is  fewer  than  the  number  of  component  processes 
because  of  experimental  confoundlngs  of  some  of  the  operations. 
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Table  1 

Performance  of  Models  in  Predicting 
Response-Choice  Data  for  Categorical  Syllogisms  of  the  First  Type 


Exper- 

iment 


Transitive 

Chain 


Complete 

Combination 


Model 

Random 

Combination 


Atmosphere 


Conversion 


r2 

RMSD 

r2 

RMSD 

r2 

RMSD 

R^ 

RMSD 

r2 

RMSD 

1 

.97 

.05 

.77 

.15 

.57 

.18 

.40 

.23 

.43 

.22 

2F 

.91 

.08 

.55 

.19 

.24 

.24 

.29 

.26 

.36 

.23 

2C 

.92 

.08 

.54 

.19 

.39 

.22 

.27 

.27 

.32 

.25 

2A 

.89 

.09 

.50 

.20 

.28 

.23 

.28 

.27 

.31 

.25 

i 

6E^ 

.96 

.19 

.82 

.41 

.81 

.43 

.73 

.41 

.06 

j 

.19  i 

6C® 

.96 

.16 

.89 

.28 

.84 

.33 

.53 

.51 

.92 

.26  i 

Note:  In  Experiment  2,  the  suffixes  F,  C,  and  A refer  to  factual,  counterfactual , 
and  anomalous  syllogism  contents  respectively.  In  Experiment  6,  the  suffixes 
E and  C refer  to  encoding  and  combination  tasVs  respect Ivelv. 

^odel  fits  were  computed  using  parameter  estimates  from  Experiment  1. 


r 


Table  2 
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Parameter  Estimates  for  Tiansltlve-Chaln  Model  In  Predicting 
Response-Choice  Data  for  Categorical  Syllogisms  of  the  First  Type 


Exper- 

iment 

Pi 

Parameter 

P2+P3+P4 

®1 

'2 

c 

i 

.54 

.46 

81 

.92 

.37 

2F 

.29 

.71 

67 

.95 

.37 

2C 

.49 

.51 

73 

.94 

.48 

2A 

.47 

.53 

70 

.92 

.48 

Note:  In  Experiment  2,  the  suffixes  F,  C,  and  A refer  to  factual,  counterfactual , 
and  anomalous  syllogism  contents  respectively.  Parameter  estimates  for  Experiment 
1 were  used  for  Experiment  3,  and  hence  estimates  for  Experiment  3 were  not  shown. 
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Table  3 

Parameter  Estimates  for  Subjects  High  & Low  in 
Verbal  & Spatial-Abstract  Abilities: 

Categorical  Syllogisms  of  the  First  Tyne 


Verbal 


Spatial-Abstract 


Parameter 

High 

I.ow 

High 

Low 

Pi 

.38 

.43 

.28 

.53 

P2+P3+P4 

.62 

.57 

.72 

.47 

«1 

.74 

.73 

.72 

.75 

®2 

.96 

.97 

.98 

.95 

c 

.46 

.45 

.45 

.46 

Note:  Parameter  estimates  are  from  Experiment  2, 
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Table  4 

Performance  of  Models  In  Predicting 
Latency  Data  for  Linear  Syllogisms: 
Proportion  of  Variance  Accounted  For 


Exper- 

Model 

iment 

Mixttire 

Linguistic 

Spatial 

1 

.81 

.60 

.57  1 

2 

.74 

.59 

.59 

3 

.84 

.69 

.58 

4 

.88 

.64 

.58 

5 

.84 

.59 

.61 

Deductive  Reasoning 
40 


Table  5 

Parameter  Estimates  for  Mixture  Theory  in  Predicting 


Latency  Data 

for  Linear  Syllogisms 

Exper- 

iment 

ENC+j^ 

ENC+2 

NEC, 

MARK 

PSM 

RS 

NCON 

QR+ 

RES+ 

1 

4648 

.... 

351 

337 

1136 

380 

— 

— 

— 

2 

4666 

— 

366 

412 

1045 

695 

— 

393 

836 

3 

— 

2986 

184 

307 

1154 

522 

538 

— 

— 

4 

— 

3124 

244 

380 

1008 

656 

396 

— 

— 

5 

1354 

143 

327 

788 

485 

305 



— 

Note:  Parameter  estimates  are  expressed  in  milliseconds.  All  estimates 
are  for  both  cued  and  uncued  data  combined. 
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Table  6 

Correlations  between  Parameter  Estimates  for  Linear  Syllogisms 
and  Composite  Ability  Scores 


Composite  Ability  Score 


Parameter 

Verbal 

Spatial 

Abstract 

ENC+ 

i 

-.25** 

-.51*** 

-.58*** 

NEC 

-.14 

-.34** 

-.41*** 

MARK 

-.20* 

-.36*** 

-.38*** 

PSM 

-.16 

-.25** 

-.35*** 

j RS 

-.26** 

-.35*** 

-.34*** 

' NCON 

-.31* 

-.24 

-.22 

RES+ 

-.30** 

-.09 

-.15 

Note:  Correlations  are  for  data  from  Experiments  1-4  combined. 


*£  <.05 
**£  <.01 
***£  <.001 


r 4 
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Figure  Captions 
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1.  Symbolic  representation  of  Information  In  transltlve-chaln  theory 
of  syllogistic  reasoning. 

2.  Inferential  rules  for  transitive-chain  theory  of  syllogistic  reasoning, 
with  application  to  an  example  of  combination  of  representations. 
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Symbolic 

Set  Relation  Representation 


Equivalence 


Subset  - Set 


Set-Subset 


Over  loo 


Disjoint 


a,  B 

b,  A 

02”^  B 

b2  A 

o',  B 

b,  A 

02"^  B 

b2  - A 

0,  B 

b,  A 

O2  -B 

bg-^  A 

0,  “>  B 

b,  ->A 

O2  “B 

< 

1 

CM 

0 

1 

CD 

< 

1 

02  - B 

bg^-A 

Euler  Diagram 
Representation 


A B 


INFERENTIAL  RULES  FOR  TRANSITIVE-CHAIN  THEORY  OF  SYLLOGISTIC  REASONING  ^ ' j 


1.  Hatch  jn  Pi  not  Component 

Y & ^ ==#  *i— > z 

2.  Hismatj:h  iii  Pj  m Component 

Xj^->-Y  & ^ ■-■  > > Z or  x^-^-Z 

APPLICATION  OF  INFERENTIAL  RULES  TO  COMBINATION  OF  REPRESENTATIONS 


Representation  1 

Representation  2 

(AB) 

aj^— > B 
a2-^  “B 

(BC) 

C 

b2~>  C 

(BA) 

bj^— > A 
b2~^  A 

(CB) 

B 

C2-»-B 

(AB) 

(BC) 

(AC^) 

(AC,) 

Combine  AB  with  BC: 

» B 

b^-»  C 



■i-c 

a2-»  -B 

b2-»  C 

a2~>  C 

a2-»  -C 

(CB) 

(BA) 

(CA^) 

(CA,) 

Combine  CB  with  BA: 

Cj^->  B 

bj— > A 

c,-iA 

C2->”B 

b2~>  A 

✓ 

C2—>  A 

C2->  -A 

1 


(ACj)  (CA^) 


C2->  A 


(AC2)  (CA^) 


a2 


C2“^  A 


(ACj) 


c 

c 


(CA2) 


*=1 

*^2 


— » A 
-»-A 


Final  Representations 
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Seductive  Reasoning 

36 


Perfonnance  of  tiodels  in  Predicting 
Response-Choice  Data  for  Categorical  Sylloglsns  of  the  First  Type 


y 


Ixper- 


laent 

Transitive- 

Chain 

Complete 

Combination 

Model 

Rando*^ 

Combination 

Atmosphere 

Conversion 

r2 

RMSD 

RMSD 

RMSD 

r2 

RMSD 

r2 

RMSD 

1 

.97 

.05 

.76 

.Ik 

.59  .18 

.57 

.18 

.78 

.13 

2F 

.91 

.08 

.67 

.16 

.3H  .21 

.29 

.23 

.^6 

.20 

2C 

.92 

.08 

.69 

.15 

.18 

.52 

.18 

.69 

.15 

2A 

.89 

.09 

.66 

.15 

.53  .16 

.^6 

.18 

.61 

.16  ' 

.96 

.19 

.86 

.37 

.86  .37 

.73 

.^1 

.96 

.19 

60* 

.96 

.16 

.89 

.26 

.8h  .32 

.53 

.51 

.92 

.26 

Vote: 

In  Experiment  2 

, the 

siif  fixes 

F,  C»  and  A refer  to  factual. 

counterf actual , 

end  anomalous  syllogism  contents  respectively.  In  Experiment  6,  the  suffixes 
S and  C refer  to  encoding  and  combination  tasks  respectively.  Experiments  1 and 
are  from  Guyote  and  Sternberg  (Rote  l).  Experiment  6 is  from  Sternberg  and  Tume 
(lote  2). 

*Model  fits  were  computed  using  parameter  estimates  from  Experiment  1 of  Guyote 
and  Sternberg  (Note  l). 
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